Detection of prosodic word boundaries by statistical modeling of mora transitions of fundamental frequency contours and its use for continuous speech recognition
نویسندگان
چکیده
We have been developing a reliable method of prosodic word boundary detection for Japanese continuous speech based on the statistical modeling of mora transitions of fundamental frequency contours of prosodic words. Modifications in the codebook sizes and in the HMM topologies improved the boundary detection performance. When using mora boundary information obtainable from the phoneme recognition process, the detection rates were reached around 73 % with 12.5 % insertion errors for speaker-open experiments. This method was then integrated to a continuous speech recognition system with un-limited vocabulary. The integrated system conducts recognition process in two stages: first stage to detect mora boundaries without prosodic information and second stage to increase mora recognition rate using prosodic word boundary information. Slight improvements in mora recognition rates were observed both in speaker-closed and -open experiments.
منابع مشابه
Prosodic word boundary detection using statistical modeling of moraic fundamental frequency contours and its use for continuous speech recognition
A new method for prosodic word boundary detection in continuous speech was developed based on the statistical modeling of moraic transitions of fundamental frequency (F 0 ) contours, formerly proposed by the authors. In the developed method, F 0 contours of prosodic words were modeled separately according to the accent types. An input utterance was matched against the models and was divided int...
متن کاملContinuous Speech Recognition of Japanese Using Prosodic Word Boundaries Detected by Mora Transition Modeling of Fundamental Frequency Contours
An HMM-based method of detecting prosodic word boundaries was developed for Japanese continuous speech and was successfully integrated into a mora-basis continuous speech recognition system with two stages operating without and with prosodic information. The method is based on modeling the fundamental frequency (F0) contour of input speech as transitions of mora-unit F0 contours and operates af...
متن کاملRepresenting prosodic words using statistical models of moraic transition of fundamental frequency contours of Japanese
We have formerly proposed a statistical model of moraic transitions of fundamental frequency (F0) contours and showed its e ectiveness for prosodic boundary detection and accent type recognition. This model represented F0 contours of prosodic words to simultaneously detect and recognize prosodic word boundaries and accent types. This paper proposes a method where prosodic word F0 contours are m...
متن کاملUse of Prosodic Features in Speech Recognition
Two methods were proposed for the use of prosodic features in speech recognition: one to detect major syntactic (phrase) boundaries as the initial phase of speech recognition, and the other to check the feasibility of the results of ordinary recognition process from the viewpoint of prosodic features. In the rst method, fundamental frequency contours were assumed as waveforms as functions of ti...
متن کاملProsodic word boundary detection using mora transition modeling of fundamental frequency contours -speaker independent experiments-
We have been developing a reliable method for prosodic word boundary detection for Japanese continuous speech based on the discrete hidden Markov modeling of fundamental frequency (F 0 ) contours in mora unit. Although a favorable result was obtained for ATR continuous speech corpus as reported already, experiments were done only on closed conditions. This paper reports the results on open and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000